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1. Introduction 



For any particularly interesting theorem one proof is never enough. Instead, the first 
■ proof sets the challenge to find a more elegant method that illuminates subtle features of 

the math, is simpler to understand, or even avoids using controversial subjects. In this 
paper we consider a subject that has attracted the attention of many mathematicians: the 
uncountability of the real numbers in the unit interval. We present the most exhaustive 
collection of proofs of this fact that we know. These range from Cantor's three published 
proofs, including his famous diagonalization method, to more recent proofs that employ 
measure theory, game theory, algebra, and analysis. 

2. Cantor's First Proof 

The first proofs of the uncountability of the unit interval were given by Georg Cantor 
(1845-1918). Cantor's research was motivated in part by his work in trying to prove 
the continuum hypothesis, a conjecture about the different sizes of infinity which he first 
formulated at the end of his 1878 article [7], see [131 P- 879]. If A is an infinite subset of 
the real numbers whose cardinality is not that of the natural numbers N, the continuum 
hypothesis states that A has the cardinality of the real numbers K. Although Cantor 
agonized over this open question for many years, he never successfully answered it — it is 
now known to be impossible to prove or disprove from the standard axioms of set theory 
m P-H8], [15]. 

Cantor defined two sets to be "equivalent" if "it is possible to put them, by some law, 
in such a relation to one other that to every element of each one of them corresponds one 
and only one element of the other" (we cite here the translation of his 1895 article in |11[ 
p. 86], though this definition already appears at the start of Cantor's 1878 [7] article, in 
which Cantor discusses the surprising fact that different dimension Euclidean spaces are 
equivalent; see [15] for a recent discussion of this proof ). Today we call this type of relation 
a bijection. We understand what Cantor called "equivalent" to mean the sets are the same 
size, or have the same cardinality. 

The most obvious infinite set is the set of natural numbers. We call sets that can be 
put into a bijection with the natural numbers countably infinite and understand these to 
be the smallest infinite sets. Sets that are finite (including the empty set) or countably 
infinite are said to be countable. If a set is neither finite nor countably infinite we call the 
set uncountably infinite or simply uncountable. Proving whether or not a set is countably 
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infinite is equivalent to proving whether or not it is possible to index the set by the natural 
numbers. 

Cantor published three proofs that show the set of real numbers is uncountable. The first, 
which we present below, appeared in an 1874 article proving the existence of transcendental 
numbers. Liouville, in 1844, had already proved that transcendental numbers exist, but 
Cantor's proof is different and he obtains, in a sense that can be made precise, that there are 
"more" transcendental numbers than algebraic numbers, We have modified this proof from 
Cantor's original notation so as to make it more accessible to the reader (for translations 
of the original paper see [21 pp. 620-622] or 13. pp. 839-843]). 

Theorem 2.1 (Cantor — 1874, [6]). If {u>k}keN is a sequence of distinct real numbers, then 
for every interval of real numbers [a, (5\ (a < f3), there is at least one r] in [a,/3] that does 
not occur in the sequence {wfcjfcgN- 

Proof. We will show that r/ exists by defining a sequence of nested closed intervals I n = 
[a n ,b n ], where a n and b n are elements in the sequence {w/c}, and choosing r\ in the inter- 
section of all the I n . 

Let k\ be the first natural number such that oj kl is in the interval (a,/3). (If k\ did not 
exist, any r] in (a,/3) would satisfy the assertion of the theorem.) Similarly, let &2 be the 
smallest natural number such that &2 > k\ and 0Jk 2 is in (a,/3). Then define 

a\ = min{uj kl ,uj k2 } and b\ = maxjw^ , u k2 } 

and let I\ = [a%, b\]. We next choose 02, 62 with 02 < 62 to be the next-indexed elements in 
{ojk} such that 02,62 are in (ai,b\) and let I2 = [02,^2]- We continue this process, so that 
if I n -i = [a n _i, h n -i] has been defined, then a n and b n are the two next-indexed elements 
of {cjfc} such that a n < b n and a n , b n are in (a n _i, b n -i). If the sequence of intervals that is 
generated is finite, suppose Ik is the last one. Then we can choose 77 to be any element of 
Ik, completing the proof. Therefore, we may assume the sequence of intervals is infinite. 
By construction, the intervals are nested in the sense that 

Jl D I 2 D ■ ■ ■ D I n ^ ■ ■ ■ . 

Consider the sequences {a^lfceN and {&fc}fc e fsj. We see the first is strictly increasing and 
bounded above by any b n and the second is strictly decreasing and bounded below by any 
a n . Therefore their limits exitst and we may define them as 

a 00 = lim a k and 6°° = lim b k . 

k— ¥00 k— ¥00 

(A modern reader would justify the existence of these limits by appealing to the monotone 
convergence theorem, a consequence of the BolzanoWVeierstrass theorem; Cantor states 
simply — in the translation of |24j — that "because they are strictly increasing in size without 
growing infinite, have a definite limit.") Since a n < b n for all n, then a°° < b°° and thus 
the interval [a 00 ,^ 00 ] is nonempty. Therefore we can choose a number r\ in [a 00 ,^ 00 ]. 

Finally we prove that rj is not equal to any of the elements in the sequence {c^}. Suppose 
it were, say rj = uj p , for some p > 1. We observe that u p cannot be in I p . Before the 
endpoints of I p were chosen, there were 2(p — 1) choices of points that were made from the 
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sequence. Therefore, if u p were in I p it would have to have been chosen already; but the 
endpoints of I p are chosen so that the interval is disjoint from all the previous choices. This 
shows that u p is not in the intersection of all the intervals. Therefore rj is a real number 
not in the sequence {u^}, completing the proof. 

□ 

Cantor's proof depends in a fundamental way on the Bolzano-Weierstrass theorem, 
or its equivalent, the monotone convergence theorem; according to Moore this theorem 
already appears in unpublished lecture notes of Weierstrass dating back to 1865, but it 
first appeared in print in an article by Cantor published in 1872 [19, P- 221]. Cantor uses 
the Bolzano-Weierstrass theorem to prove that a decreasing sequence of bounded closed 
intervals has a nonempty intersection. Basically using the same idea, with the aid of the 
notions of supremum and infimum of a set, one can obtain the following extension, which 
will be useful later: 

Theorem 2.2. Let {C n } n ^fq, be a sequence of closed and bounded sets satisfying 

Ci D C 2 D • • • D C n D • • • . 
Then their intersection Hn=i ^ n ^ s nonempty. 

Proof. Let a n = inf C n and b n = sup C n . Then a n and b n are in C n and a similar argument 
to that in the proof of Theorem 12.11 shows that their intersection is nonempty. □ 

As Cantor observes, Theorem 12.11 shows that a nontrivial interval cannot be mapped 
bijectively into a sequence, thus showing that the set of real numbers is uncountable. The 
paper, however, starts with a proof that the set of algebraic numbers is countable, and 
Cantor then mentions that his theorem implies that every interval contains infinitely many 
transcendental numbers. It is important to note, as has been argued by Gray |16| . that 
Cantor's theorem, in the case when the sequence {cok} consists of the set of algebraic num- 
bers, provides a constructive way of obtaining a transcendental number; in fact Gray uses 
this proof to give an algorithm for producing a transcendental number (we will come back 
to this after the diagonalization proof) . The last paragraph in the proof of Theorem 12.11 
shows that when the sequence {w^} is dense in the original interval, a°° = b°°, so in the 
case of the sequence of algebraic numbers, for example, a n converges to a transcendental 
number rj. 

In his letter to Dedekind of December 7, 1873, Cantor gives a more elaborate proof 
for constructing the number r/ than the one presented in Theorem 12.11 Two days later, 
on December 9, Cantor writes to Dedekind that he has found the simpler proof that he 
published in 1874 and that is essentially reproduced above (see (EH pp. 845-846] for 
translations of the letters). The reader may refer to Gray [161 p. 827] for a discussion of 
his unpublished proof. 

We will see that the end of the proof of Cantor's theorem is similar to the logic by which 
Borel arrives at a contradiction in his proof of the Heine-Borel theorem (Theorem 12. 3D . see 
[2j p. 225], whose first edition appeared in 1898. Borel uses Theorem 12. 31 in his development 
of the theory of measure on the line. 
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Theorem 2.3 (Heine-Borel — Special Case). If {I n }n>i is a sequence of open intervals 
that covers a closed bounded interval I = [a,b], then there exists a finite subsequence of the 
intervals {I nk }k=l that covers I. 

Proof. We may assume the open intervals are bounded by writing each unbounded interval 
as a countable union of bounded open intervals. Write I n = (a n , b n ) for n > 1. We describe 
an algorithm that will produce a finite cover. As in the proof of Theorem 12. H the ordering 
of the intervals according to their index plays an important role. We start by choosing 
a sequence of intervals according to their index. Let n\ be the first natural number such 
that a is in I ni and rename this interval J\. If J\ covers /, or equivalently if b is in Ji, we 
are done; if not pick the first interval containing the right endpoint of J\. Continue in this 
way. Again as before, this process generates a finite or infinite sequence. If finite we are 
done. We will show that if the sequence is infinite we arrive at a contradiction. 

The integer n\ has already been chosen such that a 6 (a ni ,b ni ). There exist a smallest 
integer n2 such that b ni S (a n2 ,b n2 ). Choose a smallest integer 77.3 in a similar manner. 
We are assuming we have generated an infinite sequence ni,ri2, ... , with b nk < b nk+1 < b 
for all k > 1. This is an increasing bounded (by b) sequence, so we can set 

b* = lim b nk . 

k— ¥00 

We know b* < b. So there exists an interval I p such that b* E I p . Write I p = (a p , b p ) is not 
one of the intervals I nk as b nk <b* < b p for all k > 1. 

The contradiction is obtained by noting that I p is a candidate to be chosen infinitely 
many times. Since b* is in I p there exists infinitely many k such that b nk is in I p . Then 
I p is a candidate when we choose the smallest integer n^ +1 such that b nk is in I nk+1 - As 
I p was not chosen this implies that infinitely many integers were chosen before p, a 
contradiction. □ 

It is interesting that Cantor and Borel's proofs are similar both in their construction 
of the sequence of intervals I n and I nk , respectively, and in their process of arriving at a 
contradiction. Cantor constructs I n by choosing endpoints a n ,b n to be the next-indexed 
elements of the sequence {u^} that are in the interior of the previous interval. Borel 
chooses his interval to be the next-indexed in the sequence that covers the right endpoint 
of the previous interval. Thus both I n and I nk are defined recursively by making use of 
the ordering of the original sequence. 

Both proofs arrive at a contradiction by showing that if the sequences were infinite, it 
would have to follow that there was an element of the sequences {u^} and {/ n } that had 
infinitely many elements preceding it. This contradicts the fact that the sequences are 
already ordered by their indices, thus whenever we pick out a specific element Uk or Ik we 
know only k — 1 elements can come before it. We observe now that the Bolzano- Weierstrass 
theorem can be obtained as a direct consequence of the Heine-Borel theorem. 

Theorem 2.4 (Bolzano- Weierstrass). If {a n } is a bounded sequence of real numbers, then 
it has an accumulation point. 
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Proof. Suppose the sequence is contained in the interval [a, b] and there is no accumulation 
point. Then there exists e > such that for each x in [a, b], the interval J x = {x — e, x + e) 
contains only finitely many elements of {a n }. As { J x } is an open cover for [a, b] and there 
is a countable number of points that are dense in [a,b], it follows that one can choose a 
countable subcollection {J Xk } that is an open cover for [a,b] (for each rational number 
in [a, b] choose k such that q^ is in (x^ — e/2,Xf, +e/2); since each x in [a,b] is e/2-close 
to some qk, the point x is in J Xk ). By the Heine-Borel theorem, there is a finite subcover, 
but this is a contradiction as the sequence is infinite. □ 

3. Cantor's Second Proof 

Cantor's second proof, published in 1884, uses the notion of a perfect set, which he had 
defined in 1883 as a set equal to its set of accumulation points [HJ p. 223]; or equivalently, 
a set that is closed and includes its set of accumulation points. As Moore remarks, Cantor's 
paper is also the first to give the definition of closed sets [19, P- 223]. 

In his proof, to construct a point that is not in a given sequence, Cantor uses Cauchy 
sequences, which he called fundamental sequences. The proof can be simplified, however, 
using Theorem 12.21 which we do here. For a proof using Cauchy sequences see [14J. Al- 
though Cantor states his theorem for subsets of R n , we continue to treat only subsets of 
the real line. 

Theorem 3.1 (Cantor- 1884, [8]). A countable set of real numbers cannot be perfect. 

Proof. Let P be a countable set, so we can write P = {p n }n&i- We will show that if we 
assume that P is perfect, then there is a point p 1 in P that is different from p n for all n in 
N, which is a contradiction, showing P could not be perfect. 

Let E\ > (for example, S\ = 1) and consider the interval B\ = {p\ — e\,p\ + £\); we say 
B\ is centered at p\ and of radius E\. As P is perfect, B\ contains infinitely many points 
of P. Then we can choose an open interval B>2 centered at a point of P that is in B\, and 
of radius £2 > such that B2 does not contain P2 and its closure B2 is included in B\. 
Again we know that B2 contains infinitely many points of P. Choose an open interval B3 
centered at a point of P that is in B2 and with radius such that B3 does not contain p% 
and its closure is included in B2. 

In this way we generate a sequence of open intervals {B n } n& fq such that B n contains a 
point of P, 

B n C B n+ \, 

and B n does not contain p n . 
Let 

c n = B n n p. 

Then the sets C n are closed and bounded, and decreasing: C n D C n+ \. Therefore there 
exists a point p' in their intersection Hn>i ^n- The point p' is in P and since it is in B n 
for all n > 1, it is different from p n . Therefore p' satisfies the properties we were seeking, 
completing the proof. 

□ 
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It follows that the unit interval is uncountable because [0, 1] is a perfect subset of the 
real numbers. 

The proof of Theorem 13.11 uses a similar method as that of Theorem 12.11 We again 
construct a sequence of nested closed intervals (the B n ) and examine the intersection of 
these intervals to find the element of P that is not in the sequence {uj^ } • But where in the 
first proof we constructed the I n by choosing endpoints that are elements of the sequence, 
here we choose the B n to exclude all € {a;^.}, i < n. 

Interestingly, the proof of the Baire category theorem follows almost exactly the same 
strategy as Theorem 13.11 The Baire category theorem states that "if X is a complete 
metric space, then the intersection of any countable collection of dense open sets in X is 
dense." Rene Louis Baire (1874-1932) proved this theorem in his doctoral thesis in 1899. 
Instead of using complete metric spaces we prove the theorem for closed subsets of R, as 
a subset of the line is complete (for the Euclidean metric) if and only if it is closed. For a 
proof of the general theorem see |23j. 

Theorem 3.2 (Baire category — Special Case). If F is a closed subset of R, then the 
intersection of any countable collection of dense open sets in F is dense in F. 

Proof Let {G n } n gN be a countable collection of dense open sets in F. Open in F means 
that each G n is of the form G' n nF for some open set G' n in R. To show that the intersection 
of the G n is dense in F we need to show that if B be any nonempty open ball in F (i.e., a 
set of the form B = {x G F : \x — xq\ < eo} for some xq in F and some £o > 0), then there 
exists a point rj in B that is in G n for all n in N. 

Since G\ is dense, there exists a point x\ in G\ (IB. Construct an open ball B\ centered 
at x\ with radius s\ < 1 such that 

Bi csnGi, 

(for example, choose E\ = Eq/2). Similarly, since G2 is dense, the set G2 H B\ contains a 
point X2- Construct an open ball B2 centered at X2 with radius £2 < ^ and such that the 
closure of B2 is contained in B\ nG^. Continue on to get the sequence of balls B n centered 
at a point x n of G n n B n _i with radius e n < such that 

B n C B n -\ n G n . 

The standard proof now proceeds to show that the sequence {x n } is a Cauchy sequence, 
and as the space is assumed complete, it must converge to a point we denote r\. In our 
case we will simplify the argument by using the additional structure of R. The sets B n are 
compact and nested 

B n D B n+ i. 

Therefore there is a point r\ in their intersection. This point must be in B, so the intersection 
n^Li G n fl B is nonempty, completing the proof. 

□ 

Now suppose the interval [0, 1] could be written as [0, 1] = UnGN{ Wn }- Taking comple- 
ments we obtain that HneNdP' ■"■] \ Wn}) is the empty set. But the sets [0, 1] \ {co n } are 
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clearly open and dense in [0, 1], and the Baire category theorem implies that their inter- 
section is dense in [0,1], so certainly not empty. Thus the unit interval is uncountable. 
More generally, this theorem can be equivalently stated as, ll F cannot be represented by a 
countable union of nowhere dense sets." (A set is nowhere dense if the interior of its closure 
is empty.) 

The Baire category theorem has had many important applications in analysis. It is used 
as a tool for proving existence results. Sets that are countable unions of nowhere dense 
sets are regarded as "small" and are called sets of first category. The complement of a first 
category set is considered "large" and we say elements of this set are typical. Proofs by 
that use the Baire category method not only show the existence of an element but show 
the typical behavior. For example, while the existence of nowhere differentiable functions 
was known following the example of Weierstrass, Banach proved in 1931 that the typical 
continuous function is nowhere differentiable (i.e., the complement set of functions that 
are nowhere differentiable is a set of first category in the set of continuous functions on the 
unit interval with the uniform metric). For a proof of this theorem and other applications 
of the Baire category method the reader may refer to |20j . 



Cantor's third and most famous proof, published in 1891, introduces his "diagonaliza- 
tion" method and is only four pages long. Cantor starts by referring to his 1874 paper 
where he proved that the real numbers cannot be put in a one-to-one correspondence with 
the natural numbers, and then proceeds to say that "it is possible to give a proof of that 
theorem without considering the irrational numbers" [9] (we have followed the translation 
in [131 PP- 920-922]). He goes on to prove that the set of all infinite sequences consisting of 
two symbols (Cantor uses m and w for the symbols) is not countable. We now introduce 
some notation. Think of {0, 1} N standing for the set of all functions from {0, 1} to N; each 
element a of {0, 1} N can be thought of as an infinite sequence consisting of 0s and Is. 

Theorem 4.1 (Cantor- 1891, [9J). The set {0, 1} N is uncountable. 

Proof. Suppose {0, 1} N were countable. Then there would exist a surjection 



giving an enumeration or listing of {0, 1} N . For each n in N, we have an element of 2 N 
denoted f(n). The value of the function f(n) at a natural number i is denoted f{n)(i). 
We now define a new element of 2 N . Define b : N -> {0, 1} such that 



Clearly b is in {0, 1} , so there must exits k in N such that f(k) = b. We observe that if 
f(k)(k) = 0, then b(k) = 1, and if f(k)(k) = 1, then b(k) = 0. Each case is a contradiction. 
Therefore there is no k with f(k) = b, contradicting that / is a surjection. It follows that 



4. Cantor's Third Proof 



/:N^{0,1} 



N 




{0, 1} cannot be countable. 



□ 
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Cantor does not give the details of how Theorem 14.11 implies that [0, 1] is uncountable, 
but one can provide such a proof by indentifying points in [0, 1] with their binary expansions 
as infinite sequences of 0s and Is. One needs to take some care as some points can have two 
such representations. Rather than considering these issues, we are going to give a slightly 
different form of this proof making use of the nested intervals theorem. This is, in fact, the 
first proof presented in [20j . where he also modifies it to give another proof of the Baire 
category theorem [20] . 

Proof. Suppose the unit interval was countable and consisted of the set {uJk}k&- We 
construct a number rj in the intersection of a certain nested sequence of intervals. Split the 
unit interval into the intervals [0, ^] and 1] and let I\ the interval that does not contain 
uj\. (If uj\ = |, then there is an ambiguity and we choose the left subinterval; one way to 
dispense with this ambiguity is to divided the unit interval into thirds instead.) Next split 
I\ into two equal-length subintervals and choose the one that does not contain to>2 (again, 
if there is an ambiguity choose the left subinterval). Continue in this way to generate a 
sequence of nested closed intervals 

Jl D h => " " " D In => " " " ■ 

Then there is a point r\ in the intersection of all these intervals (the point is in fact unique 
as the length of the intervals decreases to 0). Finally we observe that rj is different from all 
the ui n . It is clearly different from oj\ as 7] is in I2, an interval which does not contain oj\. 
Similarly we know that oj n is not in Jn+i, so it cannot equal 77. Thus we have constructed 
a point that is not in the enumeration, a contradiction. □ 

Cantor's diagonalization method can be very helpful in proving the uncountability of 
many sets. The two examples above are by no means the only ones. Raja has published 
another proof that uses the diagonalization argument but does not require the negation 
operation [21] . 

To obtain transcendental numbers using Cantor's diagonalization method one lists all 
algebraic numbers in binary expansion in the unit interval say, listing both representations 
for those numbers that have two such expansions (the dyadic rationals, i.e., those of the 
form n/2 k ). Then the diagonalization method gives the binary expansion of a transcen- 
dental number. Gray shows that this gives an algorithm that is more efficient than the 
algorithm resulting from Cantor's first proof. In addition, Gray proves that by this method 
all transcendentals in (0, 1) are obtained |16[ Theorem 3]. In this context one can ask for 
a listing of all the rationals in [0, 1] so that the algorithm yields a quadratic irrational, for 
example. For recent related results the reader may consult [18] . 

The diagonalization technique has been used in many important theorems. One appli- 
cation we mention is its use in proving that the halting problem (i.e., whether an abstract 
computer halts on a giving input) is undecidable. 

Cantor's 1891 paper may contain the first appearance of symbolic spaces, such as the 
space of all infinite sequences of 0s and Is. Did Cantor know that the space {0, 1} N is 
topologically equivalent (homeomorphic) to the middle thirds Cantor set (see section [7])? 
There is a natural metric defined on {0, 1} N . If x and y are in {0, l} N ,with x ^ y, let 
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I(x,y) be the first index i € N such that x(i) and y(i) are different and set 

and write d(x, x) = 0. So points get closer the more they agree from the start. With this 
metric {0, 1} N is a compact metric space that is perfect [23} p. 124], and where points are 
both open and closed, hence the space satisfies the property of being totally disconnected; 
it is known that these properties characterize a Cantor set, i.e., any two spaces satisfying 
them are topologically equivalent [271 P- 216]. 

5. Power Sets 

In his 1891 paper, Cantor also states that his diagonalization proof can be extended to 
prove that "for a given manifold L we can produce a manifold M whose power is greater 
than that of L" [13} p. 922], and he goes on to prove by the diagonalization method that 
there is no surjection from the unit interval to the set of all functions on the unit interval 
with values in {0, 1}, which can be identified with the set of all subsets of the unit interval. 
The power set, denoted V(X), of a set X is the set of all subsets of X. 

Another way to arrive at the uncountability of the unit interval is through power sets. 
If one examines the proof of Theorem 14.11 using the natural identification of subsets of N 
with sequences of 0s and Is that assigns i to the subset if and only if the ith element of 
the sequence is 1, ones can see that the subset of N that is represented by the sequence a 
is the set of all elements i in N such that i is not in f(i). This suggests the proof of the 
following theorem. 

Theorem 5.1. (Cantor) The cardinality ofV(X) is strictly greater than the cardinality of 
X. 

Proof. Clearly the cardinality of V(X) is greater than or equal to the cardinality of X. We 
proceed by contradiction. Assume that the cardinality of V(X) is equal to the cardinality 
of X. Then there must exist a bijection / : X — > V(X). Consider the set 

Y = {x e X\x i f{x)}. 

Y is clearly an element of V{X) and thus there must exist some y € X such that f(y) = Y. 
Is y G y? 

Suppose not. Then y is an element of X that does not map to a set containing itself 
and thus must be in Y by definition. This brings us to a contradiction. Suppose y is in Y. 
Then y is an element of X that does map to a set containing itself and therefore cannot be 
an element of Y by definition. We have a contradiction again. Therefore there is no valid 
bijection / that maps X to V(X). □ 

To use this theorem to obtain the uncountability of [0, 1] we first use the natural identi- 
fication of subsets of N with infinite sequences of 0s and Is; this establishes a bijection of 
P(N) with 2 N . Then as before we observe there is a clear identification of infinite sequences 
of 0s and Is with their binary representation in [0, 1] (being careful of the fact that the 
dyadic rationals have two representations). 
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It is hard to read the proof of Theorem 15.11 without think of Russell's Paradox. Suppose 

Y is a set consisting of all sets that that not elements of themselves, i.e., 

Y = {X\X <£ X}. 

Then we ask if Y is an element of itself. If Y G Y then by the definition of Y it follows 
that Y £ Y, a contradiction. On the other hand, iiY^Y then again by the definition of 

Y we would have that Y G Y, another contradiction. Thus we obtain a contradiction from 
the definition of this set. Thus Y cannot be a set. This paradox was published by Russell 
in 1903 and caused a re-examination of the foundations of set theory. The way it is now 
resolved is by noting that this unrestricted way of defining sets (such as the definition of 
Y), called the schema of comprehension, is not valid; Y as a class that is not a set. There 
is now a careful axiomatization for how sets are defined or constructed; one such system is 
called the Zermelo-Fraenkel axioms. 

6. Game Proofs 

In 2007, Matthew Baker [1] published a proof that essentially reconstructs the logic of 
Theorem 12.11 with a game (Baker also discusses other games, such as the Choquet game, 
which can be used to prove the Baire category theorem). The game is as follows. A subset 
S of the unit interval is given. There are two players, Alice and Bob, who take turns 
choosing elements of S. Alice goes first and chooses a number at such that < a\ < 1. 
Then Bob chooses b\ such that a\ < b\ < 1. They continue alternating and on each turn 
the player chooses a point between the previous two (i.e., on Bob's n th turn he selects b n 
such that a n < b n < 6 n -i; the choices are such that they generate a nested decreasing 
sequence of closed intervals [a n , b n ]). From the monotone sequence theorem we see that 
because {a n } is increasing and bounded above, it converges to a real number, denoted r\. 
Alice wins the game if r/ is in S, Bob wins if r] is not in S. 

Baker first proves the following theorem. 

Theorem 6.1. If S is a countable set, then there is a winning strategy for Bob. 

Proof. Let S be a countable set. If S is empty, then r\ is not in S and any strategy is a 
winning strategy. So assume it is nonempty and write it as S = {si}j<=N (we do not assume 
the terms are distinct). Alice starts and chooses < a\ < 1. Bob's strategy is to check if 
s\ is in the open interval (a\, 1). If so, he chooses b\ = s\, if not he chooses the midpoint of 
the interval (ai, 1). Then for the second move, if S2 is in the interval (a2, b\), Bob chooses 
62 = S2 ; and if not he chooses the midpoint of the interval. He continues in this way to 
generate a sequence of choices b n such that, independently of Alice's choices, for each n in 
N, the point s n is outside the open interval (a n , b n ). Since the sequences {a n } and {b n } are 
strictly increasing and decreasing, respectively, for every n £ N, a n < a < b n . Therefore 77 
is not in S and Bob has a winning strategy. □ 

If we let S be the unit interval, Alice is clearly guaranteed to win. Thus the unit interval 
is uncountable. 
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Mathematically this proof is very similar to Cantor's first proof. Alice chooses the left 
endpoint of each interval I n and Bob chooses the right endpoint. When Alice wins it means 
she has found rj which is not in the sequence {cok}- However this proof is much easier to 
visualize than the proof of Theorem 12.11 This gave us the idea to write Cantor's other 
proofs with a game argument. Here is the diagonalization proof: 

The game: Alice and Bob are given a set of numbers S C [0, 1] in their decimal expansion, 
and such that the expansion does not end in infinitely many 0s; for example, 1 is represented 
as 0.999 .... Alice and Bob construct a number rj in the following way. Alice first chooses 
a digit a\ in {0, 1, ...,9}. Then Bob chooses b\ in {0, 1, ...,9}. They repeat the process to 
find the second digit with ai and 62 and so on. The number they have constructed at the 
end of the process is O.Z1Z2Z3... where z n = a n + b n mod 10. Alice wins if the resulting 
number is in S, Bob wins otherwise. 

Theorem 6.2. // S is a countable set, then there is a winning strategy for Bob in the 
diagonalization game. 

Proof. Assume S is countable and nonempty and write it as S = {sj}ieN- Suppose the jth 
digit in the decimal expansion of s, is Sij. Whatever Alice chooses for a n Bob picks b n such 
that a n + b n = s n>n + 1 mod 10 if s ni „ + 1^0, and picks b n such that a n + b n = s n<n + 2 
mod 10 otherwise. Thus he guarantees that for each n > 1, the nth digit of n differs from 
the nth digist of s n . Therefore n is not in S and Bob wins the game. □ 

Again, as before, if S = [0, 1] then Alice is guaranteed to win. Therefore the unit interval 
is uncountable. We also note that if S is the set of all algebraic numbers, then Bob has a 
strategy to construct a transcendental number. 

Games have been studied in mathematics for some time; in particular two player games 
where the players alternate picking decreasing intervals are of the Banach-Mazur type 
and have been used to prove many results. Schmidt in |22| introduced Banach-Mazur 
type games where he proved, among other results, a theorem that implies the existence of 
uncountably many numbers badly approximable by rationals. 

7. Measure Proof 

Another interesting approach is to use results from measure theory to prove that the 
unit interval is uncountable. Informally we think of the measure of a set as its length. We 
define a set A in R to have measure zero if for each e > there exists a sequence of open 
intervals {/j} n eN such that 

00 00 
AC [J Ij and ^2\Ij\<e, 

j=l 3=1 

where \Ij \ denotes the length of interval Ij . Conceptually this tells us that a set has measure 
zero if we can cover the entire set with arbitrarily small intervals. 
We first prove the following: 

Lemma 7.1. Every countable set has measure zero. 
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Proof. Let A be a nonempty countable set with elements Xj,j £ N. Let e > 0. Define an 
open cover of A by setting 



2i+ 2 



Clearly \Ij\ = and 



oo 

e 



i=i i=l i=i 
Therefore j4 has measure zero. □ 

We next prove a lemma which implies that the unit interval does note have measure 
zero. It says that if [0, 1] (or any closed bounded interval) is covered by a countable union 
of open intervals, then the sum of the lengths of the sequence of intervals is an upper bound 
for the length of [0, 1]. It is intuitively clear but for its proof we will need the version of the 
Heine-Borel theorem we proved in Theorem 12.31 This lemma can also be found in Borel's 
development of measure [2]. 

Lemma 7.2. Suppose {Ij}j & fq is a countable collection of open intervals that covers I = 
[0,1]: 

It [jlj. 



Then 



|/|<El';l- 



Proof. From Theorem 12.31 we know that there exists a finite subcollection Ij k ,k = 1, . . . £, 
that still covers /. 

There exists an element of this subcollection, which we rename (ai,b\), such that a\ < 
< b\. If b\ < 1, there exists an element of this subcollection, which we rename (02,62) 
such that a2 < b\ < b<i- Finally we can find an element, which we rename (a p ,b p ), with 
p < £, such that a p < 1 < b p . Then, 

k 

^ J \ > (h - a x ) + (6 2 - a 2 ) + ... + (bp - a p ) 



> -a± + (61 - a 2 ) + (62 - a 3 ) H h (b p -i - a p ) + b 



p 



> b p — a p > 1 — = 1, 



since each term in parentheses is greater than zero, completing the proof. □ 

We can now show that the unit interval is uncountable; Stillwell [25] attributes this proof 
to Harnack (1885). 

Theorem 7.3. If I = [0, 1], then I does not have measure zero. 
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Proof. Consider e = \. Let {Ij} be any collection of open bounded intervals that covers 
[0, 1]. By Lemma 17.21 we know that X}jeN l-^'l — l-^l = 1 > e > thus we have that / = [0, 1] 
does not have measure zero. □ 

This tells us that countable sets have measure zero while the unit interval does not, thus 
the unit interval cannot be the same size as a countable set and must be uncountable. 
Notice that in Theorem 17.31 we have again used Theorem 12.31 The same argument works 
for any interval [a, b], a < b, in place of [0, 1]. 

In a certain sense measure theory makes the result more apparent. Once you are con- 
vinced that countable sets have measure zero (i.e., that you could cover up every element 
of a sequence with an arbitrarily small interval), then it seems obvious that it must be 
a smaller set than the unit interval which clearly has length 1. Be careful though, the 
converse is not true. There exist uncountable sets with measure zero. The most interesting 
example of such a set was defined by Cantor in 1883 (see [121 p. 329]) and is called the 
(middle thirds) Cantor set. It is defined on the unit interval by successively removing the 
middle third open subintervals of the closed intervals that are left. It turns out that the 
set that is left is not only nonempty but uncountable and has measure zero and is perfect, 
see e.g. [23]. 

8. Cauchy Sequences 

In 1969, B. R. Wenner [26] published yet another proof of the uncountability of the 
unit interval using Cauchy sequences. This proof is based on the construction of the real 
numbers from the rationals using Cauchy sequences of rational numbers, which was in fact 
the construction that Cantor proposed [12j P- 37]. Recall that a sequence {a n } n6 N is 
said to be a Cauchy sequence if for all e > there exists a natural number N such that 
I On — a m\ < £ for all n,m > N . It is clear that a convergent sequence satisfies the Cauchy 
property. A Cauchy sequence of rational numbers "wants to converge," but sometimes 
the number it would converge to is not rational. By identifying the limit of the sequence, 
which is a real number, with that particular sequence, we can construct the set of real 
numbers from the Cauchy sequences of the rationals. We must be careful because there 
exist different Cauchy sequences that converge to the same number. To avoid this we define 
an identification on the set C of all Cauchy sequences on Q by the equivalence relation ~ 
given by 

{in} ~ {b n } if and only if a n — b n converges to zero. 
Then a real number corresponds to an equivalence class of Cauchy sequences (a n ). Infor- 
mally we can state that x £ R is represented by {a n } if and only if {a n } converges to x. 
We can then write R = {(a n ) : {a n } E C}. 

Our notation works as follows: aj(n) denotes the n th term of the i th sequence. 

Theorem 8.1. The set R is uncountable. 

Proof. Assume the set of real numbers is countable and therefore we can write R = {otk '■ 
k € N}, where = (afc(n)}. Our strategy is to define a Cauchy sequence from the rational 
numbers, b(k) € Q such that {b} ^ (afc(n)). 
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We define two sequences inductively. The first, {Ni} for % € N, we take to be a strictly 
increasing sequence of natural numbers. We choose N\ such that for m, n € N, m, n > N\ 
we have 

|a m (l) -a n (l)| < ^. 
Assume N^-i to be defined and choose such that for all m,n > we have 

\a m (k) - a n (k)\ < pj^. 

We use {iVj} to define our second sequence, {&„}. Choose 6(1) 6 Q such that 

Assume to be defined and choose bk such that the following are satisfied: 

\h ~ h-i\ < T^y and \h ~ a Nk (k)\ > 

We now prove that {bk} is Cauchy and is thus an element of C. Fix e > and choose 
N such that < £■ Then if n > m > TV 

|&m + {b m +i — b m +i) + ••• + ipn-i — b n -i) — b n \ 
\(b m ~ b m +i) + (b m +i - b m+2 ) + ... + (&n-i - &n)l 

n 

J] l 6i - k+ll 
n 1 

1 

where the second to last line follows from the definition of {&} and the last line follows 
from m > N. 



\b m - b n 



< 



< 
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We now show that {b n } ^ (a k (n)). Let n > N k . {N k } is a strictly increasing function of 
integers, therefore N/. > k. It follows that 

\b n -a n (k)\ = \b n -b k + b k - a Nk (k) + a Nk (k) - a n {k)\ 

> \b k - a Nk \ - \a Nk (k) - a n (k)\ - \b n - b k \ 

1 1 n 

> o3fc+l ~ 93fc+2 ~ Z^Z \ b i- b i-±\ 



> 



> 



1 ^ 1 



=fc+l 



2 3fc+2 2 3 « 
i=fc+l 

oo 

£ 



23fc+2 2 3 W 

i=fc+l 
1 1 1 

~~ 23fc+2 ~~ 7 23fc 

This tells us the sequence {a n (k) — b n } cannot converge to zero, thus we have an element of 
C not included in any of the equivalence classes (a k (n)). Therefore our initial assumption 
was false and we cannot index the real numbers by N. □ 

This proof essentially shows that however we try to index the set of real numbers with 
N we are left with a postive distance between any two elements of our sequence. In the 
interval defined by these two elements we can always find a real number not included in 
our sequence. The construction of the reals from Cauchy sequences on the rationals is 
interesting in its own right, but complicates the proof, making this approach more difficult 
to follow than that of Cantor's original proof. 

9. Analytic Proof 

Perhaps the most unusual recent proof is one that Eliahu Levy [17J adapted from a proof 
by Bourbaki in [3]. This proof is difficult to follow because it introduces many new terms. 
We suggest the reader follow it with pencil in hand to keep track of these new terms and 
how they come into play. 

Our goal will be to define a function that relies on the fact that we are assuming the reals 
can be indexed by the natural numbers, and then show that this function gives rise to a 
contradiction. This will then prove that the reals cannot be indexed by the natural numbers 
and thus must be uncountable. Instead of using the Bolzano- Weierstrass property or the 
monotone sequence theory, this uses another completeness property of the real numbers, 
equivalent to any of these two, that is the least upper bound or supremum property. The 
supremum property states that if S is a nonempty set of real numbers that is bounded 
above, then there exists a number a in R such that (1) a > x for all x in S (i.e., a is an 
upper bound of S), and (2) for each e > there exists a number y in S such that y > a — e. 
This number is called the supremum of S and written sup 5. 

Theorem 9.1. The set IR is uncountable. 
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Proof. Assume that R is countable. Then we can write R as {x n } nS N. Define a function 
/ : R — s> R such that for all x G R, f(x) > 0, and for any finite subset F in R, the sum 

£/(*)<!■ 

An example of such a function, assuming R = {x ra } rag N, is f(x n ) = 2~ n for n in N. 
We need some preliminary definitions. For each nonempty set S in R define 

m(S) = sup < £ f(x) : F is a finite subset of S 1 > . 

[xeF J 

From the properties of / it follows that m(S) exists in R and is bounded above by 1. 
Finally, define the set 

A = {x <G R : x < m(— oo, x)}. 

Since A nonempty (0 is in A) and is bounded above by 1, it follows that it has a supremum 
so we set c = sup A Letting e = /(c) > 0, we know there exists y in A such that 

V > c- /(c). 

So c < y + /(c) and y < m (—oo,y). Also, as y is in A, y < c. Now, if i 7 is a finite set 
in (— oo,y + /(c)), it will consist of elements of (— oo,y) that do not include c, plus some 
elements of [y, y + /(c)) that may include c. Thus 

m(-oo, y + /(c)) > m(-oo, y) + /(c) > y + /(c). 

This means that y+/(c) is in A, but as y+/(c) > c, this contradicts that c is the supremum 
of A. Therefore our initial assumption is false and R is uncountable. □ 

Levy's proof is unlike any other in this paper. The use of the supremum shows that 
there are certain properties of functions defined over uncountable sets that do not transfer 
to functions over countable sets. An interesting exercise would be to explore some of the 
other properties of functions defined over the reals to see if they motivate new proofs. 

10. Conclusion 

The number of proofs of the uncount ability of the unit interval demonstrates that this 
important result has intrigued mathematicians ever since Cantor published his original 
proof. The variety of methods people have used illustrates the usefulness of this problem 
as an introduction to different areas of math. For instance, for the student new to set 
theory the game proof or Cantor's diagonalization proof are probably the best approaches 
to learning about uncount ability, but once it is a familiar result it can be the guide to 
the basics of measure theory, and analysis. In this way the uncount ability of the reals is 
important beyond its implications for different sizes of infinity: it links a familiar concept 
with possibly new ones. We challenge the reader to extend this result to more fields by 
finding new proofs that not only enhance our understanding of the structure of the unit 
interval but also serve as a guide to those areas of mathematics. 
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